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ABSTRACT 

The  7090  at  NYU  has  been  modified  to  Include  a  "Significance 
Mode"  of  operation  which  is  intended  to  facilitate  the  identi- 
fication of  significant  bits  in  the  results  of  floating  point 
arithmetic  operations.   The  manner  in  which  floating  point 
arithmetic  is  handled  in  this  mode  is  discussed.   Several  numeri- 
cal experiments  using  this  mode  are  described  and  comparisons  are 
made  with  the  ordinary  "normalized  mode."   Examples  Include  power 
series  evaluation,  linear  equations  solution,  determinant  evalu- 
ation and  matrix  Inversion.   The  main  conclusion  is  that  "Sig- 
nificance Mode"  can  be  quite  useful,  and  that  no  loss  of  accuracy 
is  encountered  with  it  as  long  as  a  few  "guard  bits"  can  be  assumed 
with  the  initial  data. 
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NYO-9190 
SIGNIFICANCE  ARITHMETIC  ON  THE  IBM  7090 

1 .    Introduction . 

Scientific  computations,  except  for  problems  in  which  no 
scaling  is  required  or  for  certain  specialized  problems  which 
deal  with  rational  numbers  only,  are  generally  handled  on 
machines  in  the  floating  point  notation.   This  notation  affords 
great  range  and  simplicity  of  programming. 

It  would  be  difficult  to  deny  the  efficacy  of  automatic 
floating  point  operations  as  a  means  for  eliminating  the 
rescallng  procedures  of  fixed  point  computing.   But  at  what 
price  has  this  convenience  been  bought?   In  the  era  of  fixed 
point  computing  we  knew  that  our  results  were  more  or  less 
justifiable   on  the  basis  of  all  reasonable  criteria  for 
judging  the  number  of  significant  places  in  a  result.   Now 
popular  normalized  floating  point  calculations  simply  avoid 
the  embarrassment  of  improperly  scaled  fixed  point  operations 
and  simultaneously  obscure  the  possibility  that  the  results 
may  be  nonsense,  because  non-significant  digits  have  been 
introduced  at  some  stages  of  the  computation.   It  would  seem 
that  automatic  procedures  could  be  useful  for  determining  whether 

cancellation  has  indeed  destroyed  the  usefulness  of  a  result. 

1  2 
Several  methods  have  been  suggested  '      for  possible 

Incorporation  into  the  hardware  of  computers  to  aid  in  the 

estimation  of  significance.   The  Maniac  III  computer  recently 
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completed  at  the  University  of  Chicago  and  the  George  computer 
being  built  at  the  Argonne  National  Laboratory  have  provision 
for  significance.   Stretch  (IBM  703O)  has  the  "noisy  mode," 
a  form  of  significance  indication. 

At  the  NYU  Computing  Center  a  7090  has  been  modified*  to 

facilitate  the  identification  of  significant  bits  using  a 

3  4 
method  '   which  avoids  some  of  the  difficulties  of  conventional 

floating  point  arithmetic  by  not  normalizing  the  numbers  used. 

2 .    Past  Experience. 

At  the  NYU  Computing  Center  we  had  already  experimented  '^ 
with  subroutines  to  control  significance  and  found  that  not  only 
was  such  care  justifiable  for  a  particular  problem  but,  even 
more,  it  was  crucial  to  one  method  for  computing  a  numerical 
solution.   Briefly,  the  problem  was  to  determine  the  flow  behind 
a  detached  shock  in  front  of  a  body  of  revolution.   If  one  assumes 
knowledge  of  the  position  of  the  shock,  then  it  is  possible  to 
express  the  flow  variables  in  a  double  power  series  whose  coef- 
ficients are  recursively  defined. 

f  =  f(x,y)  =  y^   r.^   xJ  y^ 

where   f  are  flow  variables  for  the  pressure,  density,  velocity 
components,  etc. 

An  analyst  would  then  be  satisfied,  since  these  power  series 
and  continuations  of  them  would  yield  all  the  flow  quantities 

IBM  RPQ  E02284. 
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immediately  behind  the  shock  and  even  describe  the  body  causing 
the  shock.   However  he  would  still  be  faced  with  the  practical 
problem  of  providing  a  numerical  description  of  the  flow  and 
body. 

Some  of  the  high  order  coefficients  were  so  inaccurate 
that  it  was  necessary  to  discard  them  in  order-  not  to  ruin  the 
sum,  a  difficulty  which  could  not  practically  be  taken  care  of 
without  some  monitor  of  the  accuracy  of  the  coefficients.   These 
computing  difficulties  were  dealt  with  by  R.  D.  Richtmyer'"'  and 
G.  Lewis   by  two  different  methods. 

Richtmyer  carried  a  significance  indicator  in  the  form  of 
an  extra  digit  in  his  normalized  floating  point  interpretive 
routines  on  the  Univac .   Later  Lewis  used  unnormalized  double 
precision  arithmetic  on  the  70^-*   Both  followed  the  same  rules 
for  significance  control  in  floating  point  operations. 

As  a  result  of  consideration  of  this  crucial  example  and 
a  general  belief  that  the  indication  of  significance  is  important, 
it  was  decided  to  request  the  modification  of  the  IBM  709O  to 
implement  such  indication. 

5 .    Discussion  of  Rules. 

The  most  feasible  way  to  incorporate  the  identification  of 
significant  bits  in  an  existing  machine,  the  IBM  7090,  we  found, 
was  to  introduce  a  new  mode  into  the  machine .   The  machine  would 
use  conventional  normalized  arithmetic  unless  this  "significance 
mode"  was  entered.   In  this  mode  the  standard  elementary 

See  Routines  NU  UDPl  and  NU  UDP2,  Share  distribution  No.  6l4 . 
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arithmetic  operations  would  be  decoded  according  to  a  set  of 
significance  rules.   Standard  floating  point  representation 
was  to  be  used,  except  that  each  number  was  allowed  to  have 
just  enough  leading  zeros  after  the  binary  point  so  that  all 
the  remaining  bits  are  alleged  to  be  significant  (with  possible 
exception  of  a  small  fixed  number  of  guard  digits,  at  the 
extreme  right,  to  absorb  the  accretion  of  rounding  errors). 
Addition  and  subtraction  were  to  be  unnormallzed.   Multipli- 
cation and  division  were  to  give  the  same  number  of  leading 
zeros  as  the  less  accurate  operand.   Rounding  was  to  take  place 
after  each  operation. 

If  the  floating  point  representation  of  a  number  Is 
+  2'^f,   where   f   Is  a  binary  fraction  of,  say,  27  bits  and 

0  ^  f  <  1,   we  must  suppose  that  the  true  value  Is   +2  (f+6f), 
where   5f   Is  the  error  caused  by  truncation  to  27  bits.   We 
shall  assume  that  normally   6f   Is  completely  unknown  and 
equally  likely  to  lie  anywhere  In  the  Interval   (0,2   ');  Its 
expected  value  would  be  obtained  by  giving  a  28   bit  the  value 

1  and  all  following  bits  the  value  zero.   In  particular,  this 
Is  assumed  to  be  the  case  (in  the  absence  of  more  Information) 
even  If  all  27  bits  of  the  fraction   f   are  zero;  If  such  a 
quantity  Is  used  subsequently  In  the  calculation.  It  Is  clearly 
Important  to  take  Into  account  that,  although  the  quantity  Is 
zero  to  the  best  of  our  knowledge.  It  still  has  an  error  of 

+  2"^5f. 

a)  Addition 

Addition  and  subtraction  were  easy  to  Implement  since  the 
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7090  already  had  the  instructions  Unnorraallzed  Floating  Add, 
UFA,  and  Unnormalized  Floating  Subtract,  UFS .   However,  we 
felt  that  it  was  important  that  automatic  rounding  take  place 
after  each  operation  in  the  significance  mode.   It  is  bad  enough 
that  the  7090  does  not  round  when  using  normalized  arithmetic 
where  presumably  the  last  few  bits  of  the  word  act  only  as 
"guard  bits,"  but  in  the  unnormalized  mode  without  many  "guard 
bits"  it  might  be  catastrophic  not  to  round.   Addition,  then, 
in  the  Significance  Mode,  is  done  as  an  unnormalized  floating 
add  followed  by  a  round. 

b )  Multiplication 

In  multiplication  we  wanted  the  product  to  be  only  as 
significant  as  the  less  accurate  operand.   That  is,  the  product 
should  have  as  many  leading  zeros  as  the  less  accurate  of  the 
multiplier  and  multiplicand,  with  the  exponent,  of  course, 
suitably  adjusted.   In  order  to  accomplish  this,  the  multiplier 
and  multiplicand  are  tested,  the  factor  with  the  larger  fraction 
(more  significant)  is  normalized  and  then  multiplication  is 
performed.   A  "shift  rule"  is  then  used  to  decide  whether  the 
product  should  be  shifted  left  once  more.   The  result  is  then 
rounded . 

The  shift  rule  is  necessary  if  the  product  is  not  to  be 
biased.   For  example,  consider  the  product  of:   .0101  x  .0010. 
We  would  like  the  product  to  have  two  leading  zeros  (the  number 
of  zeros  in  the  less  accurate  operand);  however,  after  normalizing 
the  more  accurate  and  multiplying  we  would  get   .0001010  (with 


suitable  exponent).   In  this  case  we  would  wish  to  have  a  left 
shift.   But  If  the  factors  were   .0111  x  .0011   we  would  get 
.0010101   which  has  the  correct  number  of  leading  zeros.   In 
the  latter  case  we  would  not  want  a  shift  because  there  was 
a  carry  in  the  product.   That  is,  we  have  a  normalized  number 
X,  1/2  ^x   <    1,      being  multiplied  by  a  number  y 2  ,    l/2  ^  y  <  1, 
with  n   leading  zeros,  clearly  the  product  will  have   n  +  1 
zeros  when  x^y  <  2~  .   This  means  that  in  the  product,  the  machine 
should  be  able  to  detect  whether  the  product  is  greater  than   I/2 
i.e.  whether  an  appropriate  carry  took  place.   This  we  were  informed 
was  not  possible,  since  the  existing  device  was  never  meant  to  de- 
tect such  carries  in  the  middle  of  a  register.   We  were  forced  to 
find  an  alternative  method.   Another  difficulty  we  faced  was  that 
the  timing  of  the  operation  permitted  examination  of  only  one  of 
the  two  factors  being  multiplied. 

We  decided  to  consider  the  factor  which  was  normalized  (the 
more  accurate  of  the  two).   This  number  x   lies  between   l/2 
and  1.   Statistically,  we  can  say  that  the  product,  in  this  context, 
will  probably  be  greater  than   l/2   if  x   is  close  to   1   and  it 
probably  will  be  less  than   I/2   if  x   is  close  to   1/2.   The 
exact  probabilities  will  of  course  depend  on  the  distribution 
of  the  factors.   The  rule  we  adopted  is  to  test  x  before  a 
multiplication.   If  x   is  less  than  3/^,      then  normalize  the 
product  one  step  if  possible.   (This  is  a  test  on  bit  10.) 

The  special  case  of  two  numbers  with  zero  fractions  is 
treated  as  if  one  factor  (the  multiplier)  had  been  normalized 
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by  inserting  a  normalization  bit  and  reducing  the  exponent 
suitably  before  multiplication.   (On  the  7090,  the  MQ,  factor 
had  its  exponent  reduced  by  27  and  a  one  forced  into  bit   9.) 
Since  true  zeros  are  the  exception  rather  than  the  rule  when 
in  the  Significance  Mode,  the  exponent  is  important  and  must 
be  retained  even  though  the  result  of  an  operation  may  have 
a  zero  fractional  part. 

c )  Division . 

We,  of  course,  wanted  the  quotient  to  have  the  same 
number  of  leading  zeros  as  the  less  accurate  of  the  operands. 
To  achieve  this,  the  divisor  is  always  normalized.   If  the 
dividend  is  less  accurate  than  the  divisor,  the  divisor  is 
normalized  and  the  division  proceeds . 

However,  if  the  divisor  is  less  accurate  than  the  dividend, 
the  dividend  ought  clearly  to  be  denormalized  some  before  di- 
viding.  This  is  accomplished  by  first  normalizing  the  dividend, 
then  simultaneously  normalizing  the  divisor  and  denormalizing 
the  dividend,  after  which  the  dividend  has  the  same  number  of 
leading  zeros  as  the  divisor  had  initially.   In  each  instance, 
it  is  necessary  to  decide  whether  to  shift  one  place  right  after 
division.   As  before,  if  we  now  let  the  numerator  have   n   leading 
zeros  and  be   y»2~  ,   where   l/2  ^  y   <    1 ,      and  let  x   be  the 
normalized  divisor,   l/2  ^  x  <  1,   the  quotient   y-2~  /x  will 
have  one  less  leading  zero  (be  one  bit  too  accurate)  when 
y/x  ^  1 .   Thus  about  half  the  time  a  shift  right  is  required  in 
order  not  to  bias  the  result  of  division.   Again  we  were  faced 
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with  the  Inability  of  the  existing  hardware  to  detect  this  and 
therefore  adopted  a  "shift  rule"  for  division  similar  to  the 
one  for  multiplication.   During  division,  the  normalized  di- 
visor is  tested;  if  it  is  less  than   3/4,   the  quotient  is 
shifted  one  additional  place  right  (in  actual  practice,  the 
division  is  stopped  one  cycle  sooner). 

A  problem  arose  in  attempting  to  have  an  automatic  round 
after  a  divide.   In  division,  of  course,  to  decide  whether  it 
is  necessary  to  round,  one  must  examine  the  remainder  and 
determine  whether  it  is  more  than  l/2  of  the  divisor.   It  was 
not  found  feasible  to  do  this  since  the  instruction  time  was 
too  short  to  allow  examination  of  the  remainder.   Instead,  after 
completing  a  division,  there  is  an  unconditional  trap  to  a  fixed 
location  where  a  small  subroutine  may  take  control  and  test  for 
rounding . 

4  .    Computer  Instructions. 

a)  ESM  --  Enter  Significance  Mode 


-0761 


0140 


S,l  11   12  25  35 

Description :   Execution  of  this  instruction  causes  the  computer 
to  enter  the  significance  mode  and  turn  on  the  significance 
mode  indicator  on  the  IBM  7151  Console.   This  instruction  is 
decoded  as  a  no-operation  on  a  7090  system  without  the  significance 
feature . 

Indicators :  Significance  Mode. 
Timing:  2  cycles . 
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Execution:  After  ESM  instruction  execution,  the  computer  system 
operates  in  this  mode  until  either  a  test  significance  mode  (TSM) 
instruction  is  executed  or  the  clear  or  reset  key  on  the  operator's 
console  is  depressed.   Since  the  address  part  of  this  instruction 
contains  part  of  the  operation  code,  address  modification  may 
change  the  operation  itself. 

b)  TSM  --  Test  Significance  Mode 


S,l  11   12  23  35 

Description:   If  the  significance  mode  indicator  is  on,  execution 
of  this  instruction  turns  it  off  and  causes  the  computer  to  skip 
the  next  sequential  instruction.   If  the  significance  mode  indicator 
is  off  when  a  TSM  instruction  is  executed,  the  next  sequential  in- 
struction is  executed.   A  TSM  instruction  is  decoded  as  a  no-operation 
on  a  7090  system  without  the  significance  feature. 
Indicators :  Significance  Mode. 
Timing:  2  cycles . 

Execution:  Since  the  address  part  of  this  instruction  contains  part 
of  the  operation  code,  address  modification  may  change  the  operation 
itself. 

When  the  709O  is  operating  in  the  significance  mode,  floating  point 
instructions  are  affected  as  follows: 
Instruction  Difference 

FAD      Decode  as  UFA  followed  by  a  rounding  operation 
(timing  increase  of  1  cycle  over  UFA). 
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FSB      Decode  as  UFS  followed  by  a  rounding  operation 

(timing  Increase  of  1  cycle  over  UFS) . 
FAM      Decode  as  UAM  followed  by  a  rounding  operation 

(timing  Increase  of  1  cycle  over  UAM) . 
FSM      Decode  as  USM  followed  by  a  rounding  operation 

(timing  Increase  of  1  cycle  over  USM). 

UFA, UFS, 

UAM,USM,FRN  Operate  In  a  normal  manner  without  timing  increase. 

PMP,UFM  Decode  as  UPM  with  the  following  changes: 

1.  The  fractional  parts  of  the  two  operands  are 
compared  to  see  which  is  the  larger;  the  larger 
fraction  is  then  normalized  and  the  characteristic 
is  accordingly  adjusted  before  multiplication. 

2.  If  both  fractions  are  zero,  the  multiplier  fraction 
is  forced  to  contain  a  one  in  position  nine  and  its 
characteristic  is  reduced  by  27-,  q  before  multiplication. 
Underflow  may  occur. 

3.  If  position  10  of  the  normalized  factor  contains  a  zero, 
the  full  length  product  is  normalized  one  step  and  the 
characteristic  is  reduced  by  one.   (This  normalization 
occurs  only  when  position  9  of  the  product  is  zero.) 

4.  The  product  is  rounded. 

5.  Underflow  or  overflow  trap  and  indications  remain  the 
same  as  with  a  standard  709O  System  (timing  increase 
of  from  0  to  5  cycles  over  standard  UFM) . 

FDPjFDH  Decode  as  usual  with  the  following  changes: 

1.  If  the  fractional  part  of  the  divisor  is  zero,  the 
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divide  check  indicator  is  turned  on.* 

2.  If  the  fractional  part  of  the  dividend  is  zero,  the 
quotient  contains  a  zero  fraction.   The  quotient 
characteristic  is  adjusted  by  the  count  necessary  to 
normalize  the  divisor  fraction.   The  quotient  fraction 
remains  zero. 

3.  If  neither  factor  is  zero,  the  fractional  parts  are 
compared  to  determine  the  larger  factor. 

h.    If  the  dividend  has  the  larger  fraction,  it  Is  normalized 
and  the  characteristic  adjusted  accordingly.   Otherwise, 
skip  to  step  6. 

5.  The  divisor  is  then  normalized  and  the  dividend  de- 
normalized  simultaneously.   Bits  leaving  dividend 
position  35  are  lost.   The  characteristic  is  adjusted 
according  to  the  number  of  shifts.   Now  proceed  to 
step  7 • 

6.  If  the  divisor  has  the  larger  fraction,  it  is  normalized 
and  the  quotient  characteristic  adjusted  accordingly. 

7-  If  position  ten  of  the  normalized  divisor  is  zero, 
one  less  reduction  cycle  than  in  standard  operation 
occurs . 

This  is  done  regardless  of  the  exponent.   If  the  divisor  is 
2  (f+6f)   where   f  =  0,   then  all  we  know  is  that  the  true 
value  of  the  divisor  lies  in  the  interval  (-2  ~   ,+2  ~   ). 
This  interval  contains  the  value  zero  and  many  other  values 
that  would  give  exponent  overflow  if  the  division  were  performed 
with  such  values. 
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8.  At  completion  of  a  significance  mode  division, 
an  unconditional  trap  occurs.   The  contents  of 
the  Instruction  counter  plus  one  are  stored  In 
the  address  portion  of  location  0000  and  the 
computer  executes  the  Instruction  located  at 
OOllo.   Underflow  or  overflow  resulting  from 

a  divide  operation  Is  flagged  In  the  decrement 
portion  of  location  0000,  as  In  normal  floating 
point  trap  operation  (timing  Increase  of  0-8 
cycles  over  standard  FDH) . 

9.  If  underflow  or  overflow  occurs,  the  trap  will 
be  to  location  OOlOg  only  as  In  normal  floating 
point  trap  operation. 

5,    A  note  on  the  Elementary  Functions. 

If  subroutines  for  the  elementary  functions  are  to  be 
Incorporated  In  a  calculation  In  the  Significance  Mode,  they 
should  accept  unnormallzed  numbers  and  give  the  right  number 
of  leading  zeros  In  the  result.   If  we  denote  by  "dig  (x)" 
the  number  of  significant  binary  digits  of  x   and  by   5x 
the  error  of  x,   then  we  can  represent 

X  =  f'2^,       where      5x  :^  2^"^^ 


and 


or 


— I  =  if  2^^ 
5x '    ' 


log„||-|  =  log^lf I  +  27 


=2'6x'  ~  ""^2 
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Let   f  =  f-,2  1^^^^,  i.e.  normalize   f   to   f-,   by  shifting 
left   lz(x)   places  (the  number  of  leading  zeros  in  f ) ;  then 


log2|f|  =  log^lf^l  -  lz(x) 


log 


X 

2'5x 


logglf^l  +  27  -  lz(x) 


Now 


1/2  1  |f^ 


log^lf-L  I  ^  0  and  ^  -  1 


Also,  since 


27  -  lz(x)  =  dig  (x), 

dig  X  >   log2l^l  1  dig  (x) 


-  1 


If   F   is  now  a  function  of  x   then 


dig  F(x)  -dig  (x)  7i   lo| 

In  particular,* 

a)   for  F(x)  =  \/x    , 


F(x) 
F' (x)5x 


log. 


5x 


log. 


F(x) 
xF' (xj 


F'  (x)  =  - 


2i/x 


dig  (F)  -  dig  (x)  Z    log2l(2)|  =  1 


b)   for   F(x)  -  e^, 


F'  (x)  =  e' 


dig  (F)  -  dig  (x)  ;:?  log^  I  l/x  |  =  -  log^lx 


c )   for  F(x)  =   In  X  , 


F' (x)  =  l/x 


dig  (F)  -  dig  (x)  ;:?  loggjln  x 


For  any  number  x,  logp|x|   is  approximately  the  exponent 


part  of  its  floating-point  format. 
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d)   for  F(x)  =  sin  x,  P'(x)  =  cos  x 


sm  X 


X  cos  X 


dig  (F)  -  dig  (x)  s;  log^ 

and  for  P(x)  =  cos  x 

dig  (F)  -  dig  (x)  -  log,    ^°^  ^ 


X  sm  X 

6 .    Use  of  the  Significance  Mode. 

The  significance  mode  makes  It  possible  to  run  existing 
programs  In  this  mode  or  In  the  conventional  mode.   Furthermore 
on  7090  machines  without  this  feature,  the  two  new  Instructions 
on  the  NYU  machine  will  be  Ignored  (decoded  as  NOP) . 

To  facilitate  the  use  of  this  mode,  a  subroutine  NU  ESM* 
Is  available  which,  when  used,  causes  the  machine  to  enter 
significance  mode   and  takes  care  of  the  unconditional  trap 
for  rounding  purposes  which  takes  place  after  a  floating 
divide  Instruction.   This  routine  also  has  entries  to  leave 
significance  mode  and  to  test  If  In  this  mode.   In  addition, 
subroutines  for  the  elementary  functions  which  handle  unnormallzed 
floating  point  numbers  and  give  the  right  number  of  leading  zeros 
In  the  results  (described  In  previous  section)  are  available,** 
as  well  as  a  format  for  Indicating  significance  In  printed  output. 

A  few  minor  pitfalls  were  encountered  In  running  existing 
programs  In  significance  mode.   Some  of  these  were: 

a)  A  common  way  to  convert  a  fixed  point  Integer  to  a 
floating  point  number  (in  Fortran  by  using  e.g.  A  =  I  or 
A  =  PL0ATF(I))   Is  to  use  a  floating  add  Instruction  (PAD). 

NYU-AEC  Computing  Center  wrlteup  O-61IOO9. 
**NYU-AEC  Computing  Center  wrlteups  0-620322,620323,620324,62032?. 
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This  floating  add  must  be  out  of  significance  mode. 

b)  In  using  logical  comparison  type  instructions  care 
should  be  taken  since  the  characteristic  (exponent)  of  an 
unnormallzed  number  is  not  a  measure  of  the  size  of  the 
floating  point  number. 

c)  In  the  significance  mode,  addition,  subtraction, 
multiplication  and  division  are  rounded  automatically,  not 
merely  truncated.   Therefore  existing  codes  using  floating 
round  instructions  (FRN)  following  certain  operations  had  to 
be  changed  in  order  not  to  get  a  double  round. 

7 .    Numerical  Experiments  using  Significance  Mode, 
a)  Sine  Series 

i)Let  X  =    [jr  +  ^  n) 

and  compute 

x^   x5 
sin  X  =  X  -  -^-j-  +  -TT   +  ■  '  • 
^  •    1?  • 

until  the  one  word  sum  is  not  changed.   As   n  gets  larger, 
this  oscillating  series  gives  worse  and  worse  results  which 
of  course  the  standard  normalized  mode  doesn't  indicate. 
This  table  presents  a  comparison  of  the  results  obtained  using 
this  series  in  the  normalized  mode,  and  in  the  significance 
mode,  and  in  the  manner  Lewis  described  in  NYO-2881 ,   The 
number  of  bits  alleged  to  be  accurate  is  shown.   In  addition, 
the  number  of  terms  in  the  series  required  is  also  shown. 


-  1^ 


SINE  SERIES 


sin  (J  +  I  n) 


7090 

n    Normalized  Mode 

0  26  bits 

1  25 

2  25 

3  22 

4  21 

5  19 

6  19 

7  l4 

8  11 

9  11 

10  7 

11  5 

12  6 
15         1 


(1) 


NYO- 
2881 

(2) 

bits 

7090 
Sig.  Mode 

27  bit 

(2) 
s 

No. 

in 

of  Terms 
Series 

26 

6  terms 

23 

25 

9 

21 

23 

11 

20 

20 

13 

17 

17 

15 

15 

16 

17 

13 

13 

18 

12 

12 

20 

8 

9 

21 

8 

6 

22 

5 

5 

24 

3 

3 

26 

1 

0 

25 

0 

1 

27 

(1) 


(2) 


Number  of  correct  bits  in  the  normalized  result  of  the  27  given. 
Number  of  bits  after  the  leading  zeros  (all  of  which  were  correct) 
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11 )  sin  X,  where  x   is  given  In  degrees.  Is  computed 
by  the  series  as  In  (l)  using  significance  mode  and  the 
results  compared  with  Rlchtmyer's  results  on  the  Unlvac 
given  In  NYO-7973.   (One  decimal  ;:;>  3-3  bits.) 

SINE  SERIES 


sin  X 

x° 

NYO-7973        n 

7090 
Slg.    Mode 

27   bits 

No.    of   Terms 

Slg.    De 
9 

clmals 
decimals 

In  Series 

0 

2    terras 

10 

9 

27 

5 

20 

9 

27 

5 

30 

9 

26 

6 

40 

9 

27 

6 

50 

9 

27 

7 

60 

8 

26 

7 

70 

8 

26 

80 

8 

26 

8 

90 

8 

27 

8 

500 

5 

16 

17 

510 

5 

16 

17 

520 

5 

14 

17 

530 

4 

13 

17 

540 

4 

13 

18 

550 

4 

13 

18 

560 

4 

13 

18 

1010 

1 

3 

25 

1020 

1 

3 

25 

1030 

1 

3 

25 

1040 

1 

3 

26 

1050 

1 

3 

26 

1500 

0 

0 

32 

1510 

0 

0 

32 

Number  of  correct  decimals  using  significance  rules  In  NYO-7973- 
Number  of  bits  after  the  leading  zeros  (all  of  which  were  correct) 
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b)  Solution  of  Linear  Equations 

The  significance  mode  can  be  useful  in  determining  the 

accuracy  of  the  solution  of  linear  equations.   If  we  attempt 

to  solve*  the  following  simple  system  of  4  equations  in  4 

unknowns,  the  standard  normalized  mode  gives  only  about  4  to 

5  significant  figures  which  are  determined  by  comparison  with 

the  known  correct  result.   However,  in  general,  the  number  of 

correct  digits  will  be  unknown.   For  example,  substituting 

the  results  back  in  the  equations  is,  as  Is  well  known,  of 

no  help.   The  significance  mode  gives  the  correct  number  of 

significant  figures. 

AX  =  B 
A 

2.8i      4.54  ^ 
-2.02       5.1398 


^3.21 
3.2 


1.1102 
1.2598 


1.0201   -1.3501 


^-2.0298 


Correct 

-1 

-2 

-1 

1 


2.55 


3.1 
-1.3702 


-2.1201 
3.64 


X 


B 


Results  Using 
7090 
Normalized  Mode 

-1.000  083  3 

-2.000  216  1 

-  .999  993  18 
1.000  107  5 


y 

AX  -  B 

with 
These  Results 

-0.000  000  1 

-0.000  000  1 

0.000  000  0 

-0.000  000  0 


3 


^-3.700^ 

1.4402 

-3.54 
1.94 


y 


Results  Using 
Slg.  Mode 


-  0.0001000  (4) 

-  0.0002000  (4) 

-  0.0010000  (3) 

+  0.0001000  (4) 


NU  LEQ,  A  Gauss  Elimination  Method  was  used,  NYU-AEC 
Computing  Center  Writeup  O-61IOO6,  or  SHARE  Dlst.  No.  I263. 
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c)  Determinant  Evaluation 

A  comparison  Is  made  for  a  given  n  --  between  the  correct 
value  of  the  determinant  of  the  Hubert  Matrix,  the  7090  normal- 
ized value,  and  the  significance  mode  value. 

DETERMINANT  EVALUATION 
Hubert  matrices    h. 


1 


ij    i+J-1 


A 


n-1 

(TTkl)' 
k=l 


n       n-1    „   „    , 

n  -r r  (    2  ,  2\n-k 

n   I   I  [n  -k  j 

k=l 


n 

Det  H  Correct   De 

^t  H  7090  Norma 

llzed 

2 

8.353  333  3  (-  2) 

8.333  333  i 

(-  2) 

3 

4.629  629  6  (-  4) 

4.629  626  2 

(-  4.^ 

4 

1.653  ^39  2  (-  7) 

1 .653  4l_4  6 

(-  7) 

5 

5.749  295  1  (-12) 

3.747  448  7 

(-12) 

6 

5.367  ...    (-18) 

5.335  337  1 

(~18) 

7 

4.836  ...    (-25) 

4.991  732  1 

(-25) 

Det  H  Sig.  Mode 


0.833  333  3 
0.046  296  2 
0.001  653  4 
0.000  037  5 
0.000  000  6 
0.000  000  0 


-  1) 

-  2) 

-  4) 

-  7) 
-11^ 
-17) 


d)  Matrix  Inversion 

1)  A  comparison  of  the  elements  of  the  inverse  for 
the  ill-conditioned  Hilbert  matrix  using  normalized  and 
significance  mode  arithmetic. 

11 )  A  comparison  of  the  elements  of  the  inverse  of 
a  well-conditioned  matrix  (Gamma  matrix). 
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MATRIX   INVERSION 


Hubert  Matrix 


h.  .  =  . ,  .  , 

Elements  of  Inverse,   c.  . 

3 


n 

Rang( 

s   of 

C  .  . 

Normalized 

Slg.  Mode 

Slg.  Mode. 

(dec 

ilmals ) 

(bits) 

(decimals ) 

2 

1-; 

10^ 

7 

25 

7 

3 

lO^. 

-102 

7 

19 

6 

4 

loi. 

-105 

6 

14 

4 

5 

loi. 

-105 

5 

9 

3 

6 

loi. 

-106 

4 

5 

2 

7 

loi. 

-108 

2 

4 

1 

B:   Gamma  Matrl:x 

^IJ 

"  ^jl 

i(n+l-j) 
n+l 

i  1  J 

^ 

1 
n+l 

^11 

=  2 

c .  . 

=  -  1  , 

i-J   =  1 

c .  . 

=  0    , 

i-J  >   1 

n 

Normalized 

h 
Slg.  Mode. 

(dei 

3imal£ 

;) 

(decimals) 

40 

6 

6 

^0 

6 

6 

100 

5 

5 

Number  of  correct  decimals  in  the  normalized  result. 

Number  of  bits  after  the  leading  zeros  (all  of  which  were  correct) 

Binary  to  decimal  conversion  of  results  used  in  ft.  2. 

Number  of  correct  decimals  in  the  Slg.  Mode.   Not  all  bits  after 

the  leading  zeros,  before  conversion,  v;ere  correct. 
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e)  Runge-Kutta. 

In  solving  the  differential  equation 

dy/dx  =  cos  x      with  Initial  conditions   x  =  0,  y  =  0, 
by  the  Gill  modification  of  Runge-Kutta*  we  found  the  results  for 
y   In  standard  and  significance  mode  as  shown.   The  significance 
mode  indicates  a  loss  of  significance  which  is  unwarranted. 

Preliminary  investigations  seem  to  indicate  that  certain 

constants  entering  in  the  Glll-Runge  Kutta  formulas  are  affecting 

the  results.   The  formulas   may  be  written  as  follows: 

k.  =  h  y^  1   =   0,1,2,3 

r.,.  =  D. (k.-E.q. ) 
1+1     1   1   1^1 

^1+1  =  ^1  +  ^1+1 

^1+1  -   ^1^1  ■"   5^1+1  +  ^1 

The  constant   Dp   Is   1  +  p-  y/2      and  the  constant  F        is   -Dp  • 
It  seems  these  constants  are  causing  our  shift  rule  on  multipli- 
cation to  be  biased.   If  we  split  both  these  constants  into  two 
parts,   1   and  p-  i/2    ,      and  do  the  distributed  multiplications 
Instead,  we  obtained  the  results  given  in  the  column  labelled 
"modified, " 


NU  DEQ  was  used,  NYU-AEC  Computing  Center  Writeup  O-61IOO6, 
or  SHARE  Dlst.  No.  I2O5. 
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Runge-Kutta 


X 


dy/dx  =  cos  x   I.C 


X  =  0 

y  -  0 


=  n/8 


n 

7090 

Normalized  Mode 

Significance 

Mode 

(Modified)Sig.  Mode 

y 

y 

y 

1 

23  bits 

23  bi 

ts 

23  bits 

2 

23 

23 

23 

3 

24 

23 

23 

4 

23 

23 

23 

5 

24 

23 

23 

6 

24 

22 

23 

7 

24 

21 

23 

8 

23 

19 

23 

9 

23 

18 

23 

10 

23 

17 

23 

11 

23 

17 

24 

12 

23 

15 

23 

13 

23 

14 

23 

14 

23 

13 

23 

15 

23 

12 

22 

16 

23 

11 

22 

17 

23 

10 

22 

18 

23 

9 

22 

19 

23 

8 

21 

20 

23 

6 

21 

21 

23 

4 

20 

22 

23 

1 

20 

Number  of  correct  bits  in  the  normalized  result 

Number  of  correct  bits  in  the  Significance  Mode. 

For  small  n,  the  results  gave  about  3  more  bits  after 

the  leading  zeros  which  were  not  correct. 
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8,    Conclusions . 

The  foregoing  examples  seem  to  justify,  at  least  tentatively, 
the  following  conclusions. 

(1)  In  calculations  of  the  kind  exemplified,  except  for  the 
unmodified  Runge-Kutta,  the  number  of  bits  carried  in  the  sig- 
nificance mode  is  very  close  to  the  number  of  correct  bits  in 
the  normalized  mode,  and  most  of  the  bits  carried  are  in  fact 
correct . 

(2)  If  the  original  rules  had  been  used,  as  was  done  by  Lewis 
and  by  Metropolis,  then  a  product  or  quotient  would  have  exactly 
the  same  number  of  leading  zeros  as  the  less  accurate  operand. 
Then  the  unmodified  Runge-Kutta  would  have  been  satisfactory, 

(3)  The  ajnplif ication  of  errors  by  accretion  of  rounding  errors 
does  not  seem  to  have  been  serious  in  any  of  the  examples  tried 
so  far. 

(^)   In  a  few  cases,  there  appears  to  have  been  a  slight  ad- 
ditional loss  of  significance  caused  by  the  use  of  unnormalized 
numbers.   However,  if  a  problem  is  such  that  (or  the  machine's 
word  is  long  enough  that)  all  input  data  can  be  supplied  with 
6-8  guard  bits  at  least  for-  problems  of  the  moderate  size  con- 
sidered here,  beyond  the  ones  regarded  as  significant,  no  true 
loss  of  significance  will  occur. 
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